On Robustness/Performance Tradeoffs in Linear Programming and Markov Decision Processes
نویسندگان
چکیده
Computation of a satisfactory policy for a decision problem when the parameters of the model are uncertain is a problem encountered in many applications. The traditional robust approach is based on a worst-case analysis and may lead to overly conservative solutions. In this paper we directly quantify the robustness to uncertainty and consider the tradeoff between the nominal performance and robustness measures. Optimization in both linear programming and Markov decision processes is discussed. For linear programming we consider the tradeoff between the nominal cost of a solution and a robustness measure that quantifies the magnitude of constraint violation under the most adversarial parameters. We propose an algorithm that computes the whole set of Pareto efficient solutions based on parametric linear programming. For Markov decision processes, we consider the tradeoff between the performance under nominal parameters and the performance under adversarial parameters. For the special case where only the rewards are uncertain, we propose an algorithm that computes the whole set of Pareto efficient policies in a single pass.
منابع مشابه
Robustness in portfolio optimization based on minimax regret approach
Portfolio optimization is one of the most important issues for effective and economic investment. There is plenty of research in the literature addressing this issue. Most of these pieces of research attempt to make the Markowitz’s primary portfolio selection model more realistic or seek to solve the model for obtaining fairly optimum portfolios. An efficient frontier in the ...
متن کاملThe Robustness-Performance Tradeoff in Markov Decision Processes
Computation of a satisfactory control policy for a Markov decision process when the parameters of the model are not exactly known is a problem encountered in many practical applications. The traditional robust approach is based on a worstcase analysis and may lead to an overly conservative policy. In this paper we consider the tradeoff between nominal performance and the worst case performance ...
متن کاملPerformance Analysis of Dynamic and Static Facility Layouts in a Stochastic Environment
In this paper, to cope with the stochastic dynamic (or multi-period) problem, two new quadratic assignment-based mathematical models corresponding to the dynamic and static approaches are developed. The product demands are presumed to be dependent uncertain variables with normal distribution having known expectation, variance, and covariance that change from one period to the next one, randomly...
متن کاملApproximate Linear Programming for Constrained Partially Observable Markov Decision Processes
In many situations, it is desirable to optimize a sequence of decisions by maximizing a primary objective while respecting some constraints with respect to secondary objectives. Such problems can be naturally modeled as constrained partially observable Markov decision processes (CPOMDPs) when the environment is partially observable. In this work, we describe a technique based on approximate lin...
متن کاملAccelerated decomposition techniques for large discounted Markov decision processes
Many hierarchical techniques to solve large Markov decision processes (MDPs) are based on the partition of the state space into strongly connected components (SCCs) that can be classified into some levels. In each level, smaller problems named restricted MDPs are solved, and then these partial solutions are combined to obtain the global solution. In this paper, we first propose a novel algorith...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007